Data Cleaning: Detecting, Diagnosing, and Editing Data Abnormalities
نویسندگان
چکیده
منابع مشابه
Data Cleaning: Detecting, Diagnosing, and Editing Data Abnormalities
I n clinical epidemiological research, errors occur in spite of careful study design, conduct, and implementation of error-prevention strategies. Data cleaning intends to identify and correct these errors or at least to minimize their impact on study results. Little guidance is currently available in the peer-reviewed literature on how to set up and carry out cleaning efforts in an effi cient a...
متن کاملDiscovering Editing Rules For Data Cleaning
Dirty data continues to be an important issue for companies. The database community pays a particular attention to this subject. A variety of integrity constraints like Conditional Functional Dependencies (CFD) have been studied for data cleaning. Data repair methods based on these constraints are strong to detect inconsistencies but are limited on how to correct data, worse they can even intro...
متن کاملEditing Rules: Discovery and Application to Data Cleaning
Dirty data is a serious problem for businesses, leading to incorrect decision making, inefficient daily operations, and ultimately wasting both time and money. A variety of integrity constraints like Conditional Functional Dependencies (CFD) have been studied for data cleaning. Data repairing methods based on these constraints are strong to detect inconsistencies but are limited on how to corre...
متن کاملA Domain-Independent Data Cleaning Algorithm for Detecting Similar-Duplicates
Data mining algorithms generally assume that data will be clean and consistent. However, in practice, this is not always the case, and for this reason the detection and elimination of duplicate records is an important part of data cleaning. The presence of similar-duplicate records causes over-representation of data. If the database contains different representations of the same data, the resul...
متن کاملExploratory Data Mining and Data Cleaning
It sounds good when knowing the exploratory data mining and data cleaning in this website. This is one of the books that many people looking for. In the past, many people ask about this book as their favourite book to read and collect. And now, we present hat you need quickly. It seems to be so happy to offer you this famous book. It will not become a unity of the way for you to get amazing ben...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: PLoS Medicine
سال: 2005
ISSN: 1549-1676
DOI: 10.1371/journal.pmed.0020267